Placement of parameter server in wide area network topology for geo-distributed machine learning

نویسندگان

چکیده

Machine learning (ML) is extensively used in a wide range of real-world applications that require data all around world to pursue high accuracy global model. Unfortunately, it impossible transmit gathered raw central center for training due privacy, sovereignty and communication cost. This brings the idea geo-distributed machine (Geo-DML), which completes ML model across multiple centers with bottleneck cost over limited area networks (WAN) bandwidth. In this paper, we study on problem parameter server (PS) placement PS architecture efficiency Geo-DML. Our optimization aims select an appropriate as algorithm based We prove NP-hard. Further, develop approximation solve using randomized rounding method. order validate performance our proposed algorithm, conduct large-scale simulations, simulation results two typical carrier network topologies show can reduce up 61.78% B4 topology 21.78% Internet2 topology.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Parameter Server for Distributed Machine Learning

We propose a parameter server framework to solve distributed machine learning problems. Both data and workload are distributed into client nodes, while server nodes maintain globally shared parameters, which are represented as sparse vectors and matrices. The framework manages asynchronous data communications between clients and servers. Flexible consistency models, elastic scalability and faul...

متن کامل

Towards Geo-Distributed Machine Learning

Latency to end-users and regulatory requirements push large companies to build data centers all around the world. The resulting data is “born” geographically distributed. On the other hand, many machine learning applications require a global view of such data in order to achieve the best results. These types of applications form a new class of learning problems, which we call Geo-Distributed Ma...

متن کامل

Communication Efficient Distributed Machine Learning with the Parameter Server

This paper describes a third-generation parameter server framework for distributed machine learning. This framework offers two relaxations to balance system performance and algorithm efficiency. We propose a new algorithm that takes advantage of this framework to solve non-convex non-smooth problems with convergence guarantees. We present an in-depth analysis of two large scale machine learning...

متن کامل

Support for Object Placement in Wide Area Heterogeneous Distributed Systems

One of the open challenges in distributed computing systems is determining how to place tasks onto processors when they are needed (in the Legion project being developed at UVA the basic computational units are modelled as objects, so the problem is one of object placement). The placement decision is crucial because it determines the run-time behavior of an object, including performance, cost a...

متن کامل

Server-assisted Latency Management for Wide-area Distributed Systems

Recently many Internet services employ wide-area platforms to improve the end-user experience in the WAN. To maintain close control over their remote nodes, the wide-area systems require low-latency dissemination of new updates for system configurations, customer requirements, and task lists at runtime. However, we observe that existing data transfer systems focus on resource efficiency for ope...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Journal of Communications and Networks

سال: 2023

ISSN: ['1976-5541', '1229-2370']

DOI: https://doi.org/10.23919/jcn.2023.000021